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Fraunhofer FIRST develops a computing service and collaborative workspace providing a convenient tool 
for simulation and investigation of quantum algorithms. To broaden the twenty qubit limit of workstation-based 
simulations to the next qubit decade we provide a dedicated high memorized Linux cluster with fast Myrinet 
interconnection network together with a adapted parallel simulator engine. This simulation service supple- 
mented by a collaborative workspace is usable everywhere via web interface and integrates both hardware and 
software as collaboration and investigation platform for the quantum community. The modular design of our 
simulator engine enables the application of various implementations and simulation techniques and is open 
for extensions motivated by the experience of the users. The beta test version realizes all common one, two 
and three qubit gates, arbitrary one and two bit gates, orthogonal measurements as well as special gates like 
Oracle, Modulo function and Quantum Fourier Transformation. The main focus of our project is the simulation 
of experimentally realizations of quantum algorithms which will make it feasible to understand the differences 
between real and ideal quantum devices and open the view for new algorithms and applications. That's why 
the simulator also can work with arbitrary Hamiltonians yielding its unitary transformation, spectrum and 
eigenvectors. To realize the various simulation tasks we integrate various implementations. The test version 
is able to simulate small quantum circuits and Hamiltonians exactly, the latter through the use of a standard 
diagonalization procedure. Circuits up to thirty qubits can be simulated exactly as well; Hamiltonians of 
that size, however, have to be approximated according to the Trotter formulae. For a restricted gate set we 
also develop a tensor-sum implementation, which makes it feasible to investigate circuits with up to sixty qubits. 

PACS numbers: 03.67.-a, 03.67.Lx, 02.70.-c, 02.70.Hm, 03.67.Mn, 07.05.Tp, 07.05.Wr 



I. INTRODUCTION 

Classical simulation of quantum processes seams like a 
Don Quixote enterprise. To simulate 31 qubits we need 32 
GByte of memory, and every additional qubit will double the 
required resources: time, memory, power and space. Even the 
biggest supercomputers, if we could use them for quantum 
gate simulations, will give up around the forty qubit limit. 
This not only shows the suiTender of the classical simula- 
tion, it also gives an impressive illustration of the exponential 
power of a quantum computer. So why should we make an 
effort in classical simulation of quantum systems? 

There are some reasons, and the most important one may 
be: to get the knowledge for building a useful quantum com- 
puter today, not tomoiTow. Realized quantum computers are 
very small (around seven qubits). Our quantum computing 
simulator is able to simulate 3 1 qubits in an easy-usable, re- 
producible way without the obstacles of experimental setups. 
The quantum computing community need not to wait anymore 
for the next generation quantum computer - new algorithms 
and ideas can be tested today. 

Another good reason is the possibility of using the quantum 
computing simulator in a way like a chip simulator in semi- 
conductor industry: as an useful tool in the first stage of the 
design process of new circuits. With its help it is possible to 
consider all conceivable quantum devices without a restriction 
to the experimental realizable ones. This opens the view for 
new ideas and concepts usable to invent new algorithms. 

An important point is that simulation makes it possible to 
compare ideal quantum circuits with their experimental re- 



alizations. In an experiment the ideal dynamics of any real 
quantum device is disturbed by errors and decoherence. A 
simulator can describe both the ideal dynamics and its errors 
and decoherence effects. The non-ideal effects can be added 
in a controllable way which gives us an offer to test modifi- 
cations of ideal algorithms improving their feasibility to run 
under real world conditions. 

Finally, a quantum simulator may also be considered as 
an educational tool. Quantum mechanics is very demanding 
for human conceivability, yet it is the fundamental key for 
development and use of quantum computers. Anything 
which makes the processes of quantum computing more 
comprehensible will promote the development of this new 
kind of information processing. The visualization of the 
quantum computing processes is an important step to improve 
its public understanding. But not only the public image of 
Quantum Information Processing will get more lucidity, the 
simulator shows the computer scientist how quantum waves 
and particles process information and it will help the physicist 
to learn that quantum mechanics can be used for more then 
the mere description of the material world. 

These arguments illustrate the usefulness of quantum sim- 
ulations and lead to the question: What software concept 
should we use to realize it? There are two contradicting re- 
quirements to regard. At first, classical simulation of quan- 
tum circuits requires high memorized hardware. A reasonable 
compromise between qubit size and costs is a multiproces- 
sor cluster with standard boards. This induces the fact that 
the implementation of the simulator has to be parallel, which 



deeply restricts the group of potential users. On the other 
hand, only an easily accessible simulator without special hard- 
ware requirements would be a practicable and useful tool for 
the quantum computing community. The contradiction be- 
tween hardware requirements and public availability can be 
solved by the software concept of an web-based computing 
service. 

This service is accessible from everywhere by everybody 
with a standard web-browser The user draws up a simulation 
task with help of a small browser applet and sends this 
simulation job to a server. This server can be equipped 
with all necessary parallel hardware and dedicated parallel 
simulator engines. By this concept we get two advantages: 
The simulator implementation can be optimally adapted to 
the hardware employed, and there is no need to make it 
compatible with diverse user platforms. The user is freed 
from any installation, administration and support tasks which 
could be slightly complicated in case of parallel hardware. 

The Fraunhofer Quantum Computing Service provides a 
dedicated 56 GByte RAM Linux cluster with fast Myrinet in- 
terconnection network enabling simulations up to 31 qubits. 
We have supplemented the simulator by a collaborative 
workspace based on the Plone/Zope 1 1 ] content management 
system which opens the possibility not only to simulate prob- 
lems but also to exchange, publish or discuss simulation runs, 
documents and ideas within the user community. We use a 
modular software design for our simulator engine taking into 
account the diversity of implementations and simulation tech- 
niques needed for the simulation of quantum processes. Ad- 
ditionally, this modular concept makes the simulator easily 
adaptable and open for extensions motivated by the experi- 
ence of the users. 

The next section will outline structure and rationale of 
the architecture of the simulator Then we will give a short 
description of the physical background and numerical tech- 
niques used for the quantum simulations engines. The last 
section will discuss the potentials and future developments of 
the Fraunhofer Quantum Computing Services. 



II. SOFTWARE CONCEPT AND TECHNOLOGY 

The structure of the Fraunhofer Quantum Simulator is given 
by three main components: Web Interface, translator, and var- 
ious modular computing engines (see Figure^. 

• The Web Interface provides graphical editing of quan- 
tum gate circuits and interaction graphs, and handles all 
the administrative work. It has been written in Java. 

• For the various simulation services offered, there is a 
number of computing engines (see below). 

• The Web Interface contains a configurator, whose prin- 
cipal activity is to analyze the job submitted by the user 
so as to infer the appropriate computing engine, the 
memory demand, the number of processors, and the ex- 
pected computation time. 
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Figure I: Software architecture of the Fraunhofer Quantum Comput- 
ing Simulator. 



• The Web Interface communicates with the translator 
in both directions by files with QML texts (Quan- 
tum Markup Language). These files describe the jobs 
and their results, respectively, in a self-contained fash- 
ion. Syntactically, QML is an XML subset and hence 
human-readable; for more information see the online 
documentation. 

• Apart from the activities in the web interface, the exe- 
cution of a job consists of the following steps: 

1. The translator translates the QML input to a 
language-independent data structure (see below). 

2. The translator invokes the computing engine se- 
lected by the configurator, passing said data struc- 
ture. 

3. The computing engine performs the requested 
computations and writes the results to an output 
file. 

4. The translator constructs a QML result file for the 
web interface from the output of the computing 
engine and the original input file. 

All in all, the system makes the computing power of a parallel 
machine accessible at an ease of use similar to that of a pocket 
calculator. 

a. Input and output data. The data passed from the 
translator to the computing engine consists of an opera- 
tor/operand tree and administrative information. The tree de- 
scribes a concatenation of quantum gates as product of the 
corresponding operators on the state space. These gates com- 
prise "conventional" ones such as CNOT, Toffoli, etc., as well 
as measurement gates (probabilistic projection operators), and 
finally exponentials of subtrees that represent sums of Hamil- 
tonians. 

As to the output of gate simulations, after every time step 
(which may contain several gates as long as they do not act 
on common bits), the following information about the state s 
is made available: (i) The Bloch vectors, i.e., the scalar prod- 
ucts (s, a'^'^^s) for each ax, Uy, and cr^ and each i quantum 
bit; (ii) the amplitudes and phases of those base states whose 
amplitude exceeds a certain threshold; (iii) the entropy. 



Purpose 



Computing engine 



simulation of quantum gate circuits and 
Hamiltonians 



state representation by distributed tree, actions of small matrices only; 
approximation of Hamiltonians by Trotter- Suzuki formulae |2, 3] 



simulation of quantum gate circuits and 
Hamiltonians of limited size 



construction, diagonalization, exponentiation, and application of entire-matrix (e.j 
by Householder method |^) 



simulation of quantum gates circuits 
(no Hamiltonians) of larger size 



space-saving state representation through analysis of gate topology 



simulation of quantum gates 
(no Hamiltonians) of larger size 



approximated state representation by truncated series of tensor states 



computation of full spectrum of a 
Hamiltonian of limited size 



computation of margins of spectrum of 
a Hamiltonian 



construction and diagonalization of entire matrix 



Lanczos method |4] 



Table I: The computing engines 



b. The computing engines. The actual numerical work 
is done by the computing engines. Each computing engine 
solves a particular problem by a particular algorithm and 
data representation with its individual advantages. In other 
words, the computing engines populate a two-dimensional 
space whose dimensions might be called "problem classes" 
and "solution concepts". This set of computing engines is the 
place where future extensions are likely to be incorporated. 
Therefore, in order not to hamper creativity, no structure is 
preimposed on this set. Adding a computing engine essen- 
tially involves the following actions: 

• Programming the computing engine as a C++ class that 
meets a certain interface; 

• devising criteria when to favor this computing engine 
over the others, and incorporating these criteria in the 
configurator 

The computing engines that are currently available or un- 
der construction are sketched in Table HI They employ vari- 
ous state representation concepts and algorithms, which, for 
instance, exploit certain redundancies of the quantum gate 
topology or certain approximations of the state representation 
in order to make the simulation of larger systems possible. 
The details will be fixed and published later The parallel im- 
plementations use MPI. 

Flexibility is further facilitated by a the fact that the QML 
language can easily be extended with new element and at- 
tribute names, and that the internal intermediate data structure 
shields the computing engines from language idiosyncrasies. 



III. QUANTUM SIMULATION 

This section describes the physical background of the quan- 
tum simulations engines. The simulation of quantum systems 
is one of the most complicated problems in physics. To de- 
fine this problem, consider a time-dependent state |i/'(i)) in 
some Hilbert space. The dynamics of this state is given by the 
Schrodinger equation 



with Hamiltonian H. Methods to solve this equation emerge 
as important tools to simulate for instance molecular fy, ^ 
and nuclear collisions |7, 8, 9], atom-surface interactions 1 10], 
high-resolution electron-microscopy image simulation [ll], 
light propagation in optical fibers 1 12], electron motion in dis- 
ordered materials 1 13] etc. A general overview can be found 
in 1 14] for instance. The formal solution to equation Q is 
given by 



m)) = eM-^tH) IV^O)) 



(2) 



which is complicate to calculate. First we remark that the 
Hamiltonian H is an element of the Lie algebra of the au- 
tomorphisms of the Hilbert space, which is the unitary group 
U{n) for some natural number n including the case n ^ oo 
which is the dimension of the Hilbert space. Then the expo- 
nential is a map from the Lie algebra to the Lie group. Thus, 
the generator exp{—itH) of the formal solution generates for 
all possible Hamiltonians all unitary transformations, i.e., all 
elements of the group U{n). Therefore we obtain two pos- 
sibilities to generate unitary transformations which serve as 
quantum computing operations: by fixing a Hamiltonian or 
a unitary matrix. The so-called "Quantum Emulator" of De 
Raedt et.al. PIS'] is the most important example of a quantum 
computing simulator using a Hamiltonian for the computa- 
tion. 

Now we choose a 2-level system describing the qubit 



|^)=a|0)+6|l) \af + \bf^l 

which can concretely represented by the spin |0) = t 
|1) = |. Next we orient the spin in the z-direction, i.e. 
Pauli matrix 



1 
-1 



is diagonal. The other Pauli matrices are given by 
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Then the Hilbert space for N qubits is a 2^-dimensional com- 
plex vector space, and a state is the sum of tensor states. In 



our simulator we use the two possibilities described above to 
choose a unitary transformation. At first there is a library of 
quantum gates or unitary gates (CNOT, ToffoU, etc.) repre- 
sented by matrices, for instance 

10 
10 
1 
10 

is the CNOT gate. Furthermore there is a collection of scal- 
able gates like QFT, Grover Step, Oracle and the Grover gate. 
Among them there is a special gate, the EXP gate which is 
unitary gate by fixing the Hamiltonian. 

Before we describe the EXP gate, we have to fix some no- 
tation. Let ~d' = {ax, <Jy,az) be the "vector" of Pauli matrices 

(i) 

and o-x,y,z is the action of a Pauli matrix on the i-th qubit, i.e. 



r(*) 



(^'a=]'y,z = 1®1 ' 



^x^y,z ^ 



i-1 



N-i 



where 1 is the 2x2 unit matrix. For the Hamiltonian, we 
assume only nearest neighbor interactions, i.e. only 2-qubit 
interactions. That is, we have to fix two couplings matrices 
E^- \e\ for the 2-qubits interactions and for the 1 -qubit in- 
teraction with an external field, respectively. Furthermore we 
need the adjacency matrix Jij containing the interaction struc- 
ture of the qubits. Finally we obtain our Hamiltonian 



H 



E^4(^'^ 



i<3 



E^^U) 



£;fVw (3) 



where if denotes the transpose Pauli "vector" with the ob- 
vious rule 



E, 



(2) 



(^i?T 



To illustrate this, we write down the ordinary quantum Ising 
model, i.e. with cr^ ® cr^-coupUng and the external field into 
the cra;-direction. Then we have to fix the matrices for all com- 
binations of i,j to be 



where we have the problem to calculate the inverse 
2 + itH J which is also complicated enough. Here we 

choose another method know as Trotter-Suzuki formula ]tMM- 
Lets assume that the Hamiltonian is a sum of two terms 



and we obtain from the Trotter-Suzuki formula the approxi- 
mation 



{—itH\ « (exp(— iti/o/«)exp(— iti/i/n))'^ 



exp 



which is correct to order t. In fl61, De Raedt introduces a 
second and a fourth order refinement of this formula. The 
second order approximation is given by 



exp I —itH 



{cxp{—itHi / {2n)) cxp{—itHo/n) 
exp{~itHi/{2n))f 



correct of order t^. In our simulator we implement this ap- 
proximation whereas in a future extension we will also imple- 
ment the forth order approximation given by 

exp(-itH] « (exp(-iti/o/(2n))cxp(-iti/i/(2r7.)) 

exp{it^C/n^) exp{-itHi/{2n)) 
exp(-iti/o/(2n)))" 

correct of order t^ where the operator C is given by 

C=[Ha + 2H,,[Ho,Hi]]/2A. 

Of course we use these formulas recursively to calculate the 
full Hamiltonian (|3j- At the end of this section we will de- 
scribe the output of a simulation. In the current version of the 
simulator we have 3 kinds of output: the Bloch vector, a kind 
of entropy as well as the probability and phase of the most 
important base vectors. Lets consider a state 

1^) = E Q !</>,) 



(2) 
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with B as the external field and Eo as interaction energy. 

Now we are left with only one problem: the exponential 
of the Hamiltonian to get the unitary transformation related to 
the Hamiltonian. For small values of N, the number of qubits, 
one can calculate the spectrum of _ff to express the exponential 
according to the rules of linear algebra. But for higher values 
of N we use an approximation. One method is the so-called 
Cranck-Nicholson method where the approximation is given 
by 



exp 



(-uh) 



2 -itH 
2 + itH 



with base vectors \4>i) . Then the probability pi and the phase 
(fii of a base vector is given by 

Pi = |cip ipi = arg(ci) 

and we define the entropy S'di/') ) of ji/') to be 

M\Cr\') 



s{m = -Y.\ 



ln(2) 



The Bloch vector Ut for the i-th qubit is given by the expec- 
tation value 



llfW 



in the notation above. 



^JJl^l 





Figure 2: Simulation result for a 9 qubit Shor algorithm. 



Before we close this section we will remark that the Hamil- 
tonian above will be extended to more realistic cases like 
NMR by adding a periodic term in the matrix E^ . Further- 
more we remark that the structure of the Hamiltonian above 
includes also all interesting cases known from condensed mat- 
ter physics. There, the electron creation a+ and annihilation 
a operators are needed. With the settings a~^ = {ax — iay)/2 
and a = {dx + i(Ty)/2 one can formulate a substitute of the 
model in terms of the Hamiltonian (|3- 



Example: Shor algorithm 

Here we will describe the output of the Shor algorithm for 
the number N = 899 with the random number a — 11 (see 
Figure 13 ■ The starting point is the division of all qubits into 
two registers: the x-register \x) and the y-register \y) where 
the length n^ of the x-register is at least twice the length Uy ~ 
nx/2 of the y-register. In this case we need Uy = 10 qubits 
for the y-register encoding the number N = 899 and at least 
Ux — 20 qubits for the x-register giving 30 qubits for the 
whole circuit. Here we use the full capacity of the simulator 
with 31 qubits. At first (time step 1) we construct a Hadamard 



state on the x-register, i.e. 

|x)=(g)-=(|0) + |l)) 

i—l ^ 

Then we have the apply the MODULO Gate (time step 2) 

\x) (g) \y) ^ |x) (g) |(y ® (a"^ mod N))) 

with a measurement (time step 3) of the y-register afterwards. 
Now we have an entanglement between both registers, as can 
be seen by the vanishing of some Bloch vectors. Beginning 
with time step 4, the quantum Fourier transformation "re- 
moves" all unnecessary states by interference and we are left 
with one state with high probability and a lot of states with 
low probability. Thus after the measurement of the x register 
the algorithm is finished. A short look into the Bloch vector 
of the x-register shows the value M = 954733. A contin- 
uous fraction expansion leads to the fraction represented by 
[10, 5, 1, 3, 9, 1, 6, 3] and we obtain for the proposed value of 
the exponent r — 7. Only factors of this value are the solution 
and in our case 

11^1" mod 899 = 869 



Finally we are looking for the greatest common divisors (gcd) 
gcd(869- 1,899) = 31 

5cd(869+ 1,899) = 29 

and we obtain the factors 29 and 31 of the number 899. 

The Figure |2] visualizes the main base states and their 
phases as well as the entropy of the states of a 9 qubit Shor 
algorithm with a 6 qubit x register and a 3 qubit y register 
The problem is the factorization of iV = 6 with a random 
number a — 4. We have done the simulation also for a 31 
qubit Shor algorithm. The results look similar, but possess 
a huge number of base states, which are too many for a in- 
structive visualization. After the first time step the entropy 
increases to the size of the x register, and we obtain all possi- 
ble base states. Then the modulo gate mixes these states with 
the states of y register, and the measurement thins out some 
of them. Beginning with time step 4, the most interesting part 
starts - the quantum Fourier transformation. During this pro- 
cess the number of interesting base states decreases and we 
finish with only one state with a high probability and many 
states of low probability. 



IV. QUANTUM INVESTIGATIONS AT THE 
INTERNET CAFE 

We have seen that the Fraunhofer Quantum Computing Ser- 
vice is characterized by three decisive aspects: it is a web- 
based computing service, it combines a simulator with a col- 
laborative workspace, and it has a modular, extensible, and 
open software design. In the following let us discuss each as- 
pect and its potentials for future developments. 

The web-based design gives the simulator a ubiquitous 
availability. There is no need to install any software or to buy 
special hardware, the computing service makes quantum in- 
vestigations feasible everywhere - at the institute, at home or 
even at the Internet cafe. All simulation runs are stored at the 
server and can be accessed everywhere and anytime. Espe- 
cially this advantage could be a source of mistrust: the own 
ideas and investigations should stored on a foreign server? 
This sensitive aspect has to be justified by strong security pro- 
tocols and high privacy standards. Each simulation job has 
to be encoded and unambiguously signed by owner and date 
making the authorship pursuable. 

Closely connected to this point is the collaborative aspect of 
the portal. The central availability, publishing and discussion 
of the newest research data and documents is a key element 
of scientific success. The integrated content management sys- 
tem of the portal supports all appropriate forms of collabora- 
tive work like publishing with and without review, discussion 



groups, news, wiki systems etc. The quantum circuits and 
simulation results can be exchange by the uniform QML doc- 
ument format. 

An important feature of the QML-based data definition is 
its hypertext ability. QML files with well-tried quantum cir- 
cuits can be stored on any web server building a library of 
established modules of quantum algorithms. This modules 
can be included as sub-circuits in other QML files by a simple 
HTML reference. This facilitates the self-organized formation 
of a public library of quantum algorithms by the community. 

This community -based building of a library of quantum cir- 
cuits can be supplemented by an establishment of a network 
of simulation servers. The modular design of the simulation 
engines provides an easy way for integration of already used 
simulation software developed by other research groups. The 
requirements to integrate other simulators in our system are 
very low. The software has to be extended by a QML parser 
which will realize a common interface for all various simula- 
tors. Then, the new QML-capable engine could be run at our 
cluster providing a new computing service for the community. 
But it is also possible to install a small Java-server on the clus- 
ter of the partner group dispatching all simulation jobs which 
requesting the new engine. The user will not see this infras- 
tructure details, he will only note that the portal offers more 
and more simulation features which are provided by more and 
more research groups of the community. Currently we start 
a cooperation with the quantum computing group at the Uni- 
versity of Newcastle to integrate group-theoretical quantum 
algorithms. 

We hope that our simulator and workspace will become a 
useful tool for the community and we will get many user re- 
sponses and comments which help us to stabilize, improve and 
extent the system leading to the one objective of our intension: 
to support all efforts which brings the quantum computer from 
dream to reality. 
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